Speech, music and songs discrimination in the context of handsets variability

نویسندگان

  • Hassan Ezzaidi
  • Jean Rouat
چکیده

The problem of speech, music and music with songs discrimination in telephony with handsets variability is addressed in this paper. Two systems are proposed. The first system uses three Gaussian Mixture Models (GMM) for speech, music and songs respectively. Each GMM comprises 8 Gaussians trained on very short sessions. Twenty six speakers (13 females, 13 males) have been randomly chosen from the SPIDRE corpus. The music were obtained from a large set of data and comprises various styles. For 138 minutes of testing time, a speech discrimination score of 97.9% is obtained when no channel normalization is used. These performance are obtained for a relatively short analysis frame (32ms sliding window, buffering of 100 ms). When using channel normalization, an important score reduction (on the order of 10 to 20%) is observed. The second system has been designed for applications requiring shorter processing times along with shorter training sessions. It is based on an empirical transformation of the ∆MFCC that enhances the dynamical evolution of tonality. It yields in average an acceptable discrimination rate of 90% (speech/music) and 84% (speech, music and songs with music).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech, Music and Songs Discrimination Variability

The problem of speech, music and music with songs discrimination in telephony with handsets variability is addressed in this paper. Two systems are proposed. The first system uses three Gaussian Mixture Models (GMM) for speech, music and songs respectively. Each GMM comprises 8 Gaussians trained on very short sessions. Twenty six speakers (13 females, 13 males) have been randomly chosen from th...

متن کامل

بررسی و تحلیل درون مایه عرفانی در ترانه سرایی

In this essay, the mystical song in songwriting is examined, the oldest Persian poetry has been found among the songs. The song is a general term used to refer to various types of poetry melodies or music accompanying, in particular Fahlaviyat, couplet, quatrain, and folkloric song. Mysticism is one of the most profound thought and philosophical schools that could be expressed itself in our son...

متن کامل

Exaggeration of Language-Specific Rhythms in English and French Children's Songs

The available evidence indicates that the music of a culture reflects the speech rhythm of the prevailing language. The normalized pairwise variability index (nPVI) is a measure of durational contrast between successive events that can be applied to vowels in speech and to notes in music. Music-language parallels may have implications for the acquisition of language and music, but it is unclear...

متن کامل

Correlation between Auditory Spectral Resolution and Speech Perception in Children with Cochlear Implants

Background: Variability in speech performance is a major concern for children with cochlear implants (CIs). Spectral resolution is an important acoustic component in speech perception. Considerable variability and limitations of spectral resolution in children with CIs may lead to individual differences in speech performance. The aim of this study was to assess the correlation between auditory ...

متن کامل

Being Politically Impolite: A Community of Practice (CofP) Analysis of Invective Songs of Western Nigerian Politicians

Earlier linguistic studies of political discourse revealed that, not many works exist on pragmatic analysis of impoliteness in this genre. Apart from Mullany (2002), who employs relational and face works to analyses impoliteness in political discourse, Taiwo (2007), Adetunji (2009), and Ademilokun (2015), who employ discourse analytical tools in analyzing the political speeches, there exist ver...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002